Elliptic curve encryption systems

ABSTRACT

An elliptic curve encryption system represents coordinates of a point on the curve as a vector of binary digits in a normal basis representation in F 2     m   . A key is generated from multiple additions of one or more points in a finite field. Inverses of values are computed using a finite field multiplier and successive exponentiations. A key is represented as the coordinates of a point on the curve and key transfer may be accomplished with the transmission of only one coordinate and identifying information of the second. An encryption protocol using one of the coordinates and a further function of that coordinate is also described.

This is a divisional of application Ser. No. 08/790,987, filed Jan. 29, 1997, now U.S. Pat. No. 6,141,420, which is a continuation of international application No. PCT/CA95/00452, filed Jul. 31, 1995, which is a continuation-in-part of application Ser. No. 08/282,263, filed Jul. 29, 1994, now abandoned.

FIELD OF THE INVENTION

The present invention relates to public key cryptography.

The increasing use and sophistication of data transmission in such fields as telecommunications, networking, cellular communication, wireless communications, “smart card” applications, audio-visual and video communications has led to an increasing need for systems that permit data encryption, authentication and verification.

It is well known that data can be encrypted by utilizing a pair of keys, one of which is public and one of which is private. The keys are mathematically related such that data encrypted with the public key may only be decrypted with the private key and conversely, data encrypted with the private key can only be decrypted with the public key. In this way, the public key of a recipient may be made available so that data intended for that recipient may be encrypted with the public key and only decrypted by the recipient's private key, or conversely, encrypted data sent can be verified as authentic when decrypted with the sender's public key.

The most well known and accepted public key cryptosystems are those based on integer factorization and discrete logarithms in finite groups. In particular, the RSA system for modulus n=p·q where p and q are primes, the Diffie-Hellman key exchange and the ElGamal protocol in Z*_(p), (p a prime) have been implemented worldwide.

The RSA encryption scheme, where two primes p and q are multiplied to provide a modulus n, is based on the integer factorization problem. The public key e and private key d are related such that their product e·d equals 1(modφ) where φ=(p−1) (q−1). A message M is encrypted by exponentiating it with the private key e to the modulus n, [C=M* (mod n)] and decrypted by exponentiating with the public key mod n [M=C^(d) (mod n)]. This technique requires the transmission of the modulus n and the public key and the security of the system is based on the difficulty of factoring a large number that has no relatively small factors. Accordingly both p and q must be relatively large primes.

One disadvantage of this system is that p and q must be relatively large (at least 512 bits) to attain an adequate level of security. With the RSA protocol this results in a 1024 bit modulus and a 512 bit public key which require significant bandwidth and storage capabilities. For this reason researchers have looked for public key schemes which reduce the size of the public key. Moreover, recent advances in analytical techniques and associated algorithms have rendered the RSA encryption scheme potentially vulnerable and accordingly raised concerns about the security of such schemes. This implies that larger primes, and therefore a larger modulus, need to be employed in order to maintain an acceptable level of security. This in turn increases the bandwidth and storage requirements for the implementation of such a scheme.

Since the introduction of the concept of public key cryptography by Diffie and Hellman in 1976, the potential for the use of the discrete logarithm problem in public key cryptosystems has been recognized. In 1985, ElGamal described an explicit methodology for using this problem to implement a fully functional public key cryptosystem, including digital signatures. This methodology has been refined and incorporated with various protocols to meet a variety of applications, and one of its extensions forms the basis for a proposed U.S. digital signature standard (DSS). Although the discrete logarithm problem, as first employed by Diffie and Hellman in their public key exchange algorithm, referred explicitly to the problem of finding logarithms with respect to a primitive element in the multiplicative group of the field of integers modulo a prime p, this idea can be extended to arbitrary groups (with the difficulty of the problem apparently varying with the representation of the group).

The discrete logarithm problem assumes that G is a finite group, and a and b are elements of G. Then the discrete logarithm problem for G is to determine a value x (when it exists) such that a^(x)=b. The value for x is called a logarithm of b to the base of a, and is denoted by log_(a)b.

The difficulty of determining this quantity depends on the representation of C. For example, if the abstract cyclic group of order m is represented in the form of the integers modulo m, then the solution to the discrete logarithm problem reduces to the extended Euclidean algorithm, which is relatively easy to solve. However, the problem is made much more difficult if m+1 is a prime, and the group is represented in the form of the multiplicative group of the finite field F_(m+1). This is because the computations must be performed according to the special calculations required for operating in finite fields.

It is also known that by using computations in a finite field whose members lie on an elliptic curve, that is by defining a group structure G on the solutions of y²+xy=x³+ax²+b over a finite field, the problem is again made much more difficult because of the attributes of elliptic curves. Therefore, it is possible to attain an increased level of security for a given size of key. Alternatively a reduced key may be used to maintain a required degree of security.

The inherent security provided by the use of elliptic curves is derived from the characteristic that an addition of two points on the curve can be defined as a further point that itself lies on the curve. Likewise the result of the addition of a point to itself will result in another point on the curve. Therefore, by selecting a starting point on the curve and multiplying it by an integer, a new point is obtained that lies on the curve. This means that where P=(x,y) is a point on an elliptic curve over a finite field [E(F_(q) _(^(n)) )], with x and y each represented by a vector of n elements then, for any other point Rε<P> (the subgroup generated by P), dP=R. To attack such a scheme, the task is to determine an efficient method to find an integer d, 0≦d≦(order of P)−1 such that dP=R. To break such a scheme, the best algorithms known to date have running times no better than 0({square root over (p)}), where p is the largest prime dividing the order of the curve (the number of points on the curve).

Thus, in a cryptographic system where the integer d remains secret, the difficulty of determining d can be exploited.

An ElGamal protocol of key exchange based on elliptic curves takes advantage of this characteristic in its definition of private and public keys. Such an ElGamal protocol operates as follows:

1. In order to set up the protocol, where a message is to be sent from A to B, an elliptic curve must be selected and a point P=(x,y), known as the generating point, must be selected.

Encryption

2. The receiver, B, then picks a random integer d as his private key. He then computes dP, which is another point on the curve, which becomes his public key that is made available to the sender and the public. Although the sender knows the value dP, due to the characteristic of elliptic curves noted above, he has great difficulty determining the private key d.

3. The sender A, chooses another random integer k, the session seed, and computes another point on the curve, kP which serves as a public session key.

This also exploits the characteristic of elliptic curves mentioned above.

4. The sender, A, then retrieves the public key dP of receiver B and computes kdP, another point on the curve, which serves as the shared encryption key for that session.

5. The sender, A, then encrypts the message M with the encryption key to obtain the ciphertext C.

6. The sender then sends the public session key kP and the ciphertext C to the receiver B.

Decryption

7. The receiver, B, determines the encryption key kdP by multiplying his private key d by kP.

8. The receiver, B, can then retrieve the message M by decrypting the ciphertext C with the encryption key kdP.

During the entire exchange, the private key d and the seed key k remain secret so that even if an interloper intercepts the session key kP he cannot derive the encryption key kdP from B's public key dP.

Elliptic curve cryptosystems can thus be implemented employing public and private keys and using the ElGamal protocol.

The elliptic curve cryptography method has a number of benefits. First, each person can define his own elliptic curve for encryption and decryption, which gives rise to increased security. If the private key security is compromised, the elliptic curve can be easily redefined and new public and private keys can be generated to return to a secure system. In addition, to decrypt data encoded with the method, only the parameters for the elliptic curve and the session key need be transmitted.

One of the drawbacks of other public key systems is the large bandwidth and storage requirements for the public keys. The implementation of a public key system using elliptic curves reduces the bandwidth and storage requirements of the public key system because the parameters can be stored in fewer bits. Until now, however, such a scheme was considered impractical due to the computational difficulties involved and the requirement for high speed calculations. The computation of kP, dP and kdP used in a key exchange protocol require complex calculations due to the mathematics involved in adding points in elliptic curve fields.

Computations on an elliptic curve are performed according to a well known set of relationships. If K defines any field, then an equation of the form y²+a₁xy+a₃y=x³+a₂x²+a₄x+a₆, where each of the coefficients a_(i) lie in K, defines an elliptic curve over K. If E is the set of points on this curve, then an abelian group can be defined on the set E∪{0}, where 0 is a special element not occurring in E. 0 acts as the zero element of the group. If P=(x,y), then −P=(x,−y) in the case of an odd characteristic, and for two points P and Q on the curve where Q≠±P, the sum P+Q is the third point on the curve where the line joining P and Q again meets the curve. If P=Q, then the tangent line is used. As in any abelian group, we use the notation nP to denote P added to itself n times if n is positive, and −P added to itself |n| times if n is negative, and 0P=0.

If F_(q) is a finite field, then elliptic curves over F_(q) can be divided into two classes, namely supersingular and non-supersingular curves. If F_(q) is of characteristic 2, i.e. q=2^(M), then the classes are defined as follows.

i) The set of all solutions to the equation y²+ay=x³+bx+c where a,b,cεF_(q), a≠0, together with a special point called the point at infinity 0 is a supersingular curve over F_(q).

ii) The set of all solutions to the equation y²+xy=x³+ax²+b where a,bεF_(g), b≠0, together with a special point called the point at infinity 0 is a non-supersingular curve over F_(q).

By defining an appropriate addition on these points, we obtain an additive abelian group. The addition of two points P(x₁, y₁) and Q(x₂, y₂) for the supersingular elliptic curve E with y²+ay=x³+bx+C is given by the following:

If P=(x₁, y₂)εE; then define −P=(x₂, y₁+a), P+0=0+P=P for all PεE.

If Q=(x₂, y₂)εE and Q≠−P, then the point representing the sum of P+0, is denoted (x₃, y₃), where $x_{3} = \left\{ {{\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)^{2} \oplus x_{1} \oplus {x_{2}\quad \left( {P \neq Q} \right){or}x_{3}}} = \left\{ {{\frac{x_{1}^{4} \oplus b^{2}}{a^{2}}\quad \left( {P = Q} \right){and}y_{3}} = \left\{ {{{\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus y_{1} \oplus {a\quad \left( {P \neq Q} \right){or}y_{3}}} = \left\{ {{\left( \frac{x_{1}^{2} \oplus b}{a} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus y_{1} \oplus {a\quad \left( {P = Q} \right)}} \right.} \right.} \right.} \right.$

The addition of two points P(x₁, y₁) and Q(x₂, y₂) for the non-supersingular elliptic curve y²+xy=x³+ax²+b is given by the following:

If P=(x₁, y₂)εE then define −P (x₁, y₁+x₁). For all PεE, 0+P=P+0=P. If Q=(x₂, y₂)εE and Q≠−P, then P+Q is a point (x₃, y₃) where $x_{3} = \left\{ {{\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)^{2} \oplus \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \oplus x_{1} \oplus x_{2} \oplus {a\quad \left( {P \neq Q} \right){or}x_{3}}} = \left\{ {{x_{1}^{2} \oplus {\frac{b}{x_{1}^{2}}\quad \left( {P = Q} \right){and}y_{3}}} = \left\{ {{{\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus x_{3} \oplus {y_{1}\quad \left( {P \neq Q} \right){or}y_{3}}} = \left\{ {x_{1}^{2} \oplus {\left( {x_{1} \oplus \frac{y_{1}}{x_{1}}} \right)x_{3}} \oplus {x_{3}\quad \left( {P = Q} \right)}} \right.} \right.} \right.} \right.$

Accordingly it can be seen that computing the sum of two points on E requires several multiplications, additions, and inverses in the underlying field F_(q). In turn, each of these operations requires a sequence of elementary bit operations

When implementing an ElGamal or Diffie-Hellman scheme with elliptic curves, one is required to compute kP=P+P+. . . +P (P added k times) where k is a positive integer and PεE. This requires the computation of (X₃, y₃) to be computed k−1 times. Even if alternative techniques such as “double and add” are utilised, it is still necessary to compute the addition of two points several times, each of which requires multiplications, additions and inverses in the underlying finite field. For large values of k which are typically necessary in cryptographic applications, this has previously been considered impractical for data communication.

BRIEF SUMMARY OF THE INVENTION

It is an object of the present invention to provide a method of encryption utilizing elliptic curves that facilitates the computation of additions of points while providing an adequate level of security in an efficient and effective manner.

The applicants have developed a method using a modified version of the Diffie-Hellman and ElGamal protocols defined in the group associated with the points on an elliptic curve over a finite field. The method involves formulating the elliptic curve calculations so as to make elliptic curve cryptography efficient, practical and viable, and preferably employs the use of finite field processor such as the Computational Method and Apparatus for Finite Field Multiplication as disclosed in U.S. Pat. No. 4,745,568. The preferred method exploits the strengths of such a processor with its computational abilities in finite fields. The inventive method structures the elliptic curve calculations as finite field multiplication and exponentiation over the field P₂ _(^(m)) . In the preferred method, a normal basis representation of the finite field is selected and the calculations which can readily be performed on a finite field processor.

The inventors have recognized that the computations necessary to implement the elliptic curve calculations can be performed efficiently where a finite field of characteristic 2 is chosen.

When computing in a field of characteristic 2, i.e. F₂ _(^(m)) , squaring is a linear operation, i.e. (A+B)² is A²+B². By adapting appropriate representations, the computation of the squared terms required in the addition of two points is greatly simplified. In particular, if a normal basis representation is chosen, squaring can be achieved through a cyclic shift of the binary vector representing the field element to be squared.

Moreover, computing inverses in F₂ _(^(m)) can be implemented with simple shift and XOR operations by selection of an appropriate representation. In some implementations, the computation of an inverse can be arranged to utilize multiple squaring operations and thereby improve the efficiency of the computation.

When such computations are performed using a normal basis representation of the finite field, the inventors have also recognized that the elliptic curve calculations are further simplified with the computations presented in this form, the applicants have realized that specialized semiconductor devices can be fabricated to perform the calculations. With the calculations presented in such a form, additions in the field F₂ _(^(m)) can be efficiently performed in one clock cycle utilizing a simple XOR operation.

Multiplications can be performed very efficiently in only n clock cycles where n is the number of bits being multiplied. Furthermore, squaring can be efficiently performed in 1 clock cycle as a cyclic shift of the bit register. Finally, inverses can easily be computed, requiring approximately log₂n multiplications rather than the approximately 2n multiplications required in other arithmetic systems.

The inventors have also recognized that the bandwidth and storage requirements of a cryptographic system utilizing elliptic curves can be significantly reduced where for any point P(x,y) on the curve, only the x coordinate and one bit of the y coordinate need be stored and transmitted, since the one bit will indicate which of the two possible solutions is the second coordinate.

The inventors have also recognized when using the ElGamal protocol that messages need not be points on the curve if the protocol is modified such that the message M is considered as a pair of field elements M₁M₂ and each is operated on by the coordinates (x₁y) of the session encryption key kdP in a predetermined manner to produce new field elements C₁C₂ that represent the ciphertext C. The receiver can then extract the message M=(m₁, m₂) by applying the inverse transformation of the predetermined manner. Although this may require an inverse operation in the field, they may be performed efficiently in the field F₂ _(¹⁵⁵) , and in particular when operating with the processor noted above.

To assist in the appreciation of the implementation of the present invention, it is believed that a review of the underlying principles of finite field operations is appropriate. The finite field F₂ is the number system in which the only elements are the binary numbers 0 and 1 and in which the rules of addition and multiplication are the following:

0+0=1+1=0

0+1=1+0=1

0×0=1×0=0×1=0

1×1=1

These rules are commonly called modulo-2 arithmetic. All additions specified in logic expressions or by adders in this application are performed modulo2 as an XOR operation. Furthermore, multiplication is implemented with logical AND gates.

The finite field F₂ _(^(m)) , where m is an integer greater than 1, is the number system in which there are 2^(m) elements and in which the rules of addition and multiplication correspond to arithmetic modulo an irreducible polynomial of degree D with coefficients in F₂. Although in an abstract sense there is for each m only one field F₂ _(^(m)) , the complexity of the logic required to perform operations in F₂ _(^(m)) depends strongly on the particular way in which the field elements are represented. These operations may be performed using processors implemented in either hardware or software with dedicated hardware processors generally considered faster.

The conventional approach to operations performed in F₂ _(^(m)) is described in such papers as T. Bartee and D. Schneider, “computation with Finite Fields”, Information and Control, Vol. 6, pp. 79-98, 1963. In this conventional approach, one first chooses a polynomial P(X) of degree m which is irreducible over F₂ _(^(m)) , that is, P(X) has binary coefficients but cannot be factored into a product of polynomials with binary coefficients each of whose degree is less than m. An element A in F₂ _(^(m)) is then defined to be a root of P(X), that is, to satisfy P(A)=0. The fact that P(X) is irreducible guarantees that the m elements A⁰=1, A, A², . . . A^(m−1) of F₂m are linearly independent over F₂.

For the purposes of illustration, the example of F₂ _(^(e)) will be used with the choice of P(X)=X³+X+1 for the irreducible polynomial of degree 3. The next step is to define A as an element of F², such that A³+A+1=0. The following assignment of unit vectors is then made:

A ⁰=1=[1,0,0]

A ¹=[0,1,0]

A ²=[0,0,1]

An arbitrary element B of F₂, is now represented by the binary vector [b₂, b₁, b₀] with the meaning that B=[b₂, b₂, b₀]=b₂A²+b₂A+b₀.

If we represent a second element C=[c₂, c₁, c₀], it follows that B+C=[b₂⊕c₂, b₂⊕c₂, b₀⊕c₀].

Thus, in the conventional approach, addition in F₂ _(^(e)) is easily performed by logic that merely forms the modulo-2 sum of the two vectors representing the elements to be summed component-by-component. Multiplication is, however, considerably more complex to implement.

Continuing the example, from the irreducible polynomial it can be seen that A³=A+1 and A⁴=A²+A where use has been made of the fact that −1+1 in F(2). In hardware, multiplication can be simplified by taking advantage of the special feature of a finite field F₂ _(^(m)) that there always exists a so-called normal basis for the finite field. That is, one can always find a field element N such that N , N², N⁴ . . . N^(2m−1) are a basis for F₂ _(^(m)) . Every field element B can be uniquely written as B=b_(m−1)N^(2m−2)+. . . +b₂N⁶+b₂N²+b₀N=[b_(m−1) . . . , b₂, b₁, b₀] where b₀, b₂, b₂, . . . b_(m−1) are binary digits.

For example, in the finite field F₂ _(³) , if we let N=[1,1,0]

Normal Basis Normal basis Element Field Representation Vector [0,0,0] — [0,0,0] [1,0,0] N² + N⁴ [1,1,1] [0,1,0] N + N² + N⁴ [0,1,1] [0,0,1] N + N² [1,0,1] [1,1,0] N [1,0,0] [1,0,1] N + N⁴ [0,1,0] [0,1,1] N [1,1,0] [1,1,1] N² [0,0,1]

Then, if B=[b_(m−1), . . . , b₂, b₂, b₀] and C=[c_(m−1), . . . , c₂, c₁, c₀] are any two elements of F₂ _(^(m)) in normal basis representation, then the product D=B×C=[d_(m−1), . . . , d₂, d₁, d₀] has the property that the same logic circuitry which when applied to the components or binary digits of the vectors representing B and C produces d_(m−1) will sequentially produce the remaining components d_(m−2), . . . , d₂, d₁, d₀ of the product when applied to the components of the successive shifts of the vectors representing B and C.

As illustrated in U.S. Pat. No. 4,745,568 for Computational Method and Apparatus for Finite Field Multiplication, multiplication may be implemented by storing bit vectors B and C in respective shift registers and establishing connections to respective accumulating cells such that a grouped term of each of the expressions d₁ is generated in respective ones of m accumulating cells. By rotating the bit vectors B and C in the shift registers and by rotating the contents of the accumulating cells, each grouped term of a respective binary digit d, is accumulated in successive cells. Thus all of the binary digits of the product vector are generated simultaneously in the accumulating cells after one complete rotation of the bit vectors B and C.

One attribute of operating such a processor is that in the field F₂ _(^(m)) , is that squaring is a linear operation in the sense that for every pair of elements B and C in F₂ _(^(m)) , (B+C)²=B²+C². It is the case for every element B of F₂ _(^(m)) that B² ^(m) =B.

In particular in a normal basis representation, squaring an element involves a cyclic shift of the vectors representation of the element, i.e. if B=[b_(m−1), . . . , b₂, b₁, b₀] then B²=[b_(m−2), . . . , b₂, b₁, b₀, b_(m−1)].

Thus when using the processor exemplified above, squaring may be achieved in one cycle. Moreover, this general characteristic of F₂ _(^(m)) , where squaring is a linear operation, may be exploited in other implementations, such as software, where a normal basis representation is not used.

As noted above, the inventors have taken advantage of the efficiency of the mathematical operations in F₂ _(^(m)) in the implementation of an elliptic curve encryption scheme. The applicants have developed a method of formulating the elliptic curve calculations so as to make elliptic curve cryptography efficient, practical and viable. The preferred method employs the use of a finite field processor such as the Computational Method and Apparatus for Finite Field Multiplication as disclosed in U.S. Pat. No. 4,745,568. The method couples the attractive cryptographic characteristics of elliptic curves with the strengths of the field processor through its computational abilities in finite field F₂ _(^(m)) . The inventive method structures the elliptic curve calculations as operations, such as multiplication and exponentiation, over the field where F₂ _(^(m)) , which can readily be calculated on a finite field processor.

BRIEF DESCRIPTION OF THE DRAWINGS

An embodiment of the invention will now be described by way of example only with reference to the accompanying drawings in which:

FIG. 1 is a diagram of the transmission of an encrypted message from one location to another,

FIG. 2 is a diagram of an encryption module used with the communication system of FIG. 1,

FIG. 3 is a diagram of a finite field processor used in the,encryption and decryption module of FIG. 2.

FIG. 4 is a flow chart showing movement of the elements through the processor of FIG. 3 in computing an inverse function.

FIG. 5 is a flow chart showing movement of elements through the processor of FIG. 3 to compute the addition of two points.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

An embodiment of the invention will first be described utilising an ElGamal key exchange protocol and a Galois field F₂ _(¹⁵⁵) to explain the underlying principles. Further refinements will then be described.

System Components

Referring therefore to FIG. 1, a message M is to be transferred from a transmitter 10 to a receiver 12 through a communication channel 14. Each of the transmitters 10 and receiver 12 has an encryption/decryption module 16 associated therewith to implement a key exchange protocol and an encryption/decryption algorithm.

The module 16 is shown schematically in FIG. 2 and includes an arithmetic unit 20 to perform the computations in the key exchange and generation. A private key register 22 contains a private key, d, generated as a 155 bit data string from a random number generator 24, and used to generate a public key stored in a public key register 26. A base point register 28 contains the coordinates of a base point P that lies in the elliptic curve selected with each coordinate (x, y), represented as a 155 bit data string. Each of the data strings is a vector of binary digits with each digit being the coefficient of an element of the finite field in the normal basis representation of the coordinate.

The elliptic curve selected will have the general form y²+xy=x³+ax²+b and the parameters of that curve, namely the coefficients a and b are stored in a parameter register 30. The contents of registers 22, 24, 26, 28, 30 may be transferred to the arithmetic unit 20 under control of a C.P.U. 32 as required.

The contents of the public key register 26 are also available to the communication channel 14 upon a suitable request being received. In the simplest implementation, each encryption module 16 in a common security zone will operate with the same curve and base point so that the contents of registers 28 and 30 need not be accessible. If further sophistication is required, however, each module 16 may select its own curve and base point in which case the contents of registers 28, 30 have to be accessible to the channel 14.

The module 16 also contains an integer register 34 that receives an integer k, the session seed, from the generator 24 for use in encryption and key exchange. The module 16 has a random access memory (RAM) 36 that is used as a temporary store as required during computations.

The encryption of the message M with an encryption key kdP derived from the public key dP and session seed integer k is performed in an encryption unit 40 which implements a selected encryption algorithm. A simple yet effective algorithm is provided by an XOR function which XOR's the message m with the 310 bits of the encryption key kdP. Alternative implementations such as the DES encryption algorithm could of course be used.

An alternative encryption protocol treats the message m as pairs of coordinates m₁, m₂, each of 155 bit lengths in the case of F₂ _(¹⁵⁵) , and XOR's the message m₁, m₂ with the coordinates of the session key kdP to provide a pair of bit strings (m₁⊕x₀) (m₂⊕y₀) For further security a pair of field elements z₁z₂ are also formed from the coordinates (x₀y₀) of kdP.

In one embodiment, the elements z₁z₂ are formed from the concatenation of part of x₀ with part of y₀, for example, z₁=x₀₁∥y₀₂ and z₂=x₀₂∥y₀₁ where x₀₁ is the first half of the bit string of x₀

x₀₂ is the second half of the bit string of x₀

y₀₁ is the first half of the bit string of y₀

y₀₂ is the second half of the bit string of y₀

The first elements z₁ and z₂ when treated as field elements are then multiplied with respective bit strings (m₁⊕x₀) and (m₂⊕x₀) to provide bit strings c₁ c₂ of ciphertext c.

i.e. c₁=z₁ (m₁⊕x₀)

c₂=z₂ (m₂⊕y₀)

In a preferred implementation of the encryption protocol, a function of x₀ is used in place of y₀ in the above embodiment. For example the function is x₀ ³ is used as the second 155 bit string so that

c₁=z₁ (m₁⊕x₀)

c₂=z₂ (m₂⊕x₀ ³)

and

z₁=x₀₂∥x₀₂ ³

z₂=x₀₂∥x₀₂ ³

where x₀₂ ³ is the first half of x⁰ ³

x₀₂ ³ is the second half of x⁰ ³

This protocol is also applicable to implementation of elliptic curve encryption in a field other than F₂ _(^(m)) , for example z_(p) or in general F_(p) _(^(m)) .

Where z_(p) is used it may be necessary to adjust the values of x₀ and y₀ or x₀ ³ to avoid overfow in the multiplication with z₁ and z₂. Conventionally this may be done by setting the most significant bit x₀ and F_(p) _(^(m)) or y₀ to zero.

Key Generation, Exchange and Encryption

In order for the transmitter 10 to send the message M to the receiver 12, the receivers public key is retrieved by the transmitter 10. The public key is obtained by the receiver 12 computing the product of the secret key d and base point P in the arithmetic unit 20 as will be described more fully below. The product dP represents a point on the selected curve and serves as the public key. The public key dP is stored as two 155 bit data strings in the public key register 26.

Upon retrieval of the public key dP by the transmitter 10, it is stored in the RAM 36. It will be appreciated that even though the base point P is known and publicly available, the attributes of the elliptic curve inhibit the extraction of the secret key d.

The transmitter 10 uses the arithmetic unit 20 to compute the product of the session seed k and the public key dP and stores the result, kdP, in the RAM 36 for use in the encryption algorithm. The result kdP is a further point on the selected curve, again represented by two 155 bit data strings or vectors, and serves as an encryption key.

The transmitter 10 also computes the product of the session seed k with the base point P to provide a new point kP, the session public key, which is stored in the RAM 36.

The transmitter 10 has now the public key dP of the receiver 12, a session public key kP and an encryption key kdP and may use these to send an encrypted message. The transmitter 10 encrypts the message N with the encryption key kdP in the encryption unit 40 implementing the selected encryption protocols discussed above to provide an encrypted message C. The ciphertext C is transmitted together with the value kP to the encryption module 16 associated with receiver 12.

The receiver 12 utilises the session public key kP with its private key d to compute the encryption key kdP in the arithmetic unit 20 and then decrypt the ciphertext C in the encryption unit 40 to retrieve the message M.

During this exchange, the secret key d and the session seed k remain secret and secure. Although P, kP and dP are known, the encryption key kdP cannot be computed due to the difficulty in obtaining either d or k.

The efficacy of the encryption depends upon the efficient computation of the values kP, dP and kdP by the arithmetic unit 20. Each computation requires the repetitive addition of two points on the curve which in turn requires the computation of squares and inverses in F_(m) _(^(m)) .

Operation of the Arithmetic Unit

The operation of the arithmetic unit 20 is shown schematically in FIG. 3. The unit 20 includes a multiplier 48 having a pair of cyclic shift registers 42, 44 and an accumulating register 46. Each of the registers 42, 44, 46 contain M cells 50 a, 50 b . . . 50 m, in this example 155, to receive the m elements of a normal basis representation of one of the coordinates of e.g. x, of P. As fully explained in U.S. Pat. No. 4,745,568, the cells 50 of registers 42, 44 are connected to the corresponding cells 50 of accumulating register 46 such a way that a respective grouped term is generated in each cell of register 46. The registers 42,44,46 are also directly interconnected in a bit wise fashion to allow fast transfers of data between the registers.

The movement of data through the registers is controlled by a control register 52 that can execute the instruction set shown in the table below:

TABLE 1 INSTRUCTION SET Operation Size Clock Cycles Field Multiplication 155 bit blocks 156 MULT Calculation of Inverse 24 multiplications approx. 3800 INVERSE I/O 5-32 bit transfers per 10 10 clock cycles WRITE(A,B or C) read/write to registers 2 clock cycles READ(A,B or C) per transfer Elementary Register 155 bit parallel operation (idle) NOP Rotate (A,B or C) Copy (A←B) (A←C) (A←B) (B←C) SWAP (A⇄B) CLEAR (A,B or C) SET (A,B or C) ADD (A•B) ACCUMULATE

The unit 20 includes an adder 54 to receive data from the registers 42,44,46 and RAM 36. The adder 54 is an XOR function and its output is a data stream that may be stored in RAM 36 or one of the registers 42, 44. Although shown as a serial device, it will be appreciated that it may be implemented as a parallel device to improve computing time. Similarly the registers 42,44,46 may be parallel loaded. Each of the registers 42,44,46, is a 155 bit register and is addressed by a 32 bit data bus to allow 32 bit data transfer in 2 clock cycles and the entire loading in 5 operations.

The subroutines used in the computation will now be described.

a) Multiplication

The cyclic shift of the elements through the registers 42, 44 m times with a corresponding shift of the accumulating register 46 accumulates successive group terms in respective accumulating cells and a complete rotation of the elements in the registers 42, 44, produces the elements of the product in the accumulating register 46.

b) Squaring

By operating in F₂ _(^(m)) and adopting a normal basis representation of the field elements, the multiplier 48 may also provide the square of a number by cyclically shifting the elements one cell along the registers 42. After a one cell shift, the elements in the register represent the square of the number. In general, a number may be raised to the power 2 ⁸ by cyclically shifting g times through a register.

c) Inversion

Computation of the inverse of a number can be performed efficiently with the multiplier 48 by implementing an algorithm which utilises multiple squaring operations. The inverse X⁻² is represented as X² ^(m) ⁻² or X²⁽² ^(m−1) ⁻²)

If m−1 is considered as the product of two factors g,h then X⁻¹ may be written as X²⁽² ^(gh) ⁻¹⁾ or β² ^(gh−1) where β=X².

The exponent 2 ^(gh) ⁻¹ is equivalent to $\left( {2^{g} - 1} \right)\left( {\sum\limits_{i = 0}^{h - 1}2^{ig}} \right)$

The term 2 ^(g) ⁻¹ may be written as $\sum\limits_{j = 0}^{g - 1}2^{j}$

so that $X^{- 1} = \beta^{{({\sum\limits_{j = 0}^{g - 1}2^{j}})}{({\sum\limits_{i = 0}^{h - 1}2^{ig}})}}$

$\beta^{\sum\limits_{j = 0}^{g - 1}2^{j}} \equiv {\beta^{1 + 2 + 2^{2} + {2^{3}.\quad.\quad.\quad 2^{g - 1}}}\quad {and}}$

is denoted γ

This term may be computed on multiplier 48 as shown in FIG. 4 by initially loading registers 42, with the value X. This is shifted 1 cell to represent β (i.e. X²) and the result loaded into both registers 42, 44.

Register 44 is then shifted to provide 2 and the registers 42, 44 multiplied to provide β²⁺¹ in the accumulating register 46. The multiplication is obtained with one motion, i.e. a m bit cyclic shift, of each of the registers 42, 44, 46.

The accumulated term β²⁺² is transferred to register 44 and register 42, which contains β² is shifted one place to provide β⁴. The registers 42, 44 are multiplied to provide β¹⁺²⁺⁴.

This procedure is repeated g−2 times to obtain γ. As will be described below, γ can be exponentiated in a similar manner to obtain $\gamma^{\sum\limits_{i = 0}^{h - 1}2^{ig}}\quad {i.e.\quad x^{- 1}}$

This term can be expressed as γ¹⁺² ^(g) ⁺² ^(2g) ⁺² ^(3g) ^(. . . 2) ^((h−1)g)

As noted above, γ can be exponentiated to the 2^(g) by shifting the normal basis representation g times in the register 42, or 44.

Accordingly, the registers 42, 44 are each loaded with the value γ and the register 42 shifted g times to provide γ² ^(g) . The registers 42, 44 are multiplied to provide γ·γ² ^(g) or γ³⁺² ^(g) in the accumulating register 46. This value is transferred to the register 44 and the register 42 shifted g times to provide γ² ² ².

The multiplication will then provide γ¹⁺² ^(g+) ² ² _(g). Repetition of this procedure (h−1)g−1 times produces the inverse of X in the accumulating register 46.

From the above it will be seen that squaring, multiplying, and inverting can be effectively performed utilising the finite field multiplier 48.

Addition of Point P to Itself (P+P) Using the Subroutines

To compute the value of dP for generation of the public key, the arithmetic unit 20 associated with the receiver 12 initially computes the addition of P+P. As noted in the introduction, for a nonsupersingular curve the new point Q has coordinates (X₃, Y₃) where $X_{3} = {X_{1}^{2} \oplus \frac{b}{X_{1}^{2}}}$ $Y_{3} = {X_{1}^{2} \oplus {\left( {X_{1} \oplus \frac{Y_{1}}{X_{1}}} \right)X_{3}} \oplus X_{3}}$

To compute X₃, the following steps may be implemented as shown in FIG. 5.

The m bits representing X₁ are loaded into register 42 from base point register 28 and shifted one cell to the right to provide X₁ ². This value is stored in RAM 36 and the inverse of X₁ ² computed as described above.

The value of X₁ ⁻² is loaded into register 44 and the parameter b extracted from the parameter register 30 and loaded into register 42. The product bx₁ ⁻² is computed in the accumulating register 46 by rotating the bit vectors and the resultant value XOR'd in adder 52 with value of X₁ ² stored in RAM 36 to provide the normal basis representation of X₃. The result may be stored in RAM 36.

A similar procedure can be followed to generate Y₃ by first inverting X₁, multiplying the result by Y, and XORing with X₁ in the adder 52. This is then multiplied by X₃ stored in RAM 36 and the result XOR'd with the value of X₃ and X₁ ² to produce Y³.

The resultant value of (X₃, Y₃) represents the sum of P+P and is a new point Q on the curve. This could then be added to P to produce a new point Q′. This process could be repeated d−2 times to generate dP.

The addition of P+Q requires the computation of (X₃, Y₃) where $x_{3} = {\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)^{2} \oplus \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \oplus x_{1} \oplus x_{2} \oplus a}$ and $y_{3} = {{\left( \frac{y_{3} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus x_{3} \oplus {y_{1}.}}$

This would be repeated d−2 times with a new value for Q at each iteration to compute dP.

Whilst in principal this is possible with the arithmetic unit 20, in practice the large numbers used make such a procedure infeasible. A more elegant approach is available using the binary representation of the integer d.

Computation of dP From 2P

To avoid adding dissimilar points P and Q, the binary representation of d is used with a doubling method to reduce the number of additions and the complexity of the additions.

The integer d can be expressed as

${d = {\sum\limits_{i = 0}{\lambda_{i}2^{i}}}},{\lambda_{i} \in \left( {0,1} \right)}$ and ${dP} = {\sum\limits_{i = 0}{\lambda_{i}\left( {2^{i}P} \right)}}$ i.e.λ_(m)2^(m)P + λ_(m − 1)2^(m − 1)P  .  .  .  λ₃2³P + λ₂2²P + λ₁2P + λ₀P

The values of λ are the binary representation of d.

Having computed 2P, the value obtained may be added to itself, as described above at FIG. 5 to obtain 2²P, which in turn can be added itself to provide 2³P etc. This is repeated until 2¹P is obtained.

At each iteration, the value of 2^(i)P is retained in RAM 36 for use in subsequent additions to obtain dp.

The arithmetic unit 20 performs a further set of additions for dissimilar points for those terms where λ is 1 to provide the resultant value of the point (x₃, y₃) representing dP.

If for example k=5, this can be computed as 2²P+P or 2P+2P+P or Q+Q+P. Therefore the result can be obtained in 3 additions; 2P=Q takes 1 addition, 2P+2P =Q+Q=R takes 1 and R+P takes 1 addition. At most t doublings and t subsequent additions are required depending on how many λ are 1.

Performance of Arithmetic Units 20

For computations in a Galois field F₂ _(¹⁵⁵) it has been found that computing the inverse takes approximately 3800 clock cycles.

The doubling of a point, i.e. the addition of point to itself, takes in the order of 4500 clock cycles and for a practical implementation of a private key, the computation of the public key dP may be computed in the order of 1.5×10⁵ clock cycles. With a clock rate typically in the order of 40 mHz, the computation of dP will take in the order of 3×10⁻² seconds. This throughput can be enhanced by bounding the seed key k with a Hamming weight of, for example, 20 and thereby limit the number of additions of dissimilar points.

Computation of Session Public Key kP and Encryption Key kdP

The session public key kP can similarly be computed with the arithmetic unit 20 of transmitter 10 using the base point P from register 28. Likewise, because the public key dP is represented as a point, (x₃, y₃), the encryption key kdP can be computed in similar fashion.

Each of these operations will take a similar time and can be completed prior to the transmission.

The recipient 12 is similarly required to compute dkP as he received the ciphertext C which again will take in the order of 3×10² seconds, well within the time expected for a practical implementation of an encryption unit.

The public key dP, and the session key kP are each represented as a 310 bit data string and as such require a significantly reduced bandwidth for transmission. At the same time, the attributes of elliptic curves provides a secure encryption strategy with a practical implementation due to the efficacy of the arithmetic unit 20.

Curve Selection

a) The Selection of the Field F_(g) _(^(h))

The above example has utilised a field of 2¹⁵⁵ and a non-supersingular curve. The value 155 was chosen in part because an optimal normal basis exists in F₂ _(¹⁵⁵) over F₂. However, a main consideration is the security and efficiency of the encryption system. The value 155 is large enough to be secure but small enough for efficient operation. A consideration of conventional attacks that might be used to break the ciphertext suggests that with elliptic curves over F₂ _(^(m)) , a value of m of about 130 provides a very secure system. Using one thousand devices in parallel, the time taken to find one logarithm is about 1.5×10¹¹ seconds or at least 1500 years using the best known method and the field F₂ _(¹⁵⁵) . Other techniques produce longer run times.

b) Supersingular v. Nonsupersingular Curves

A comparison of attacks on data encrypted using elliptic curves suggests that non-supersingular curves are more robust than supersingular curves. For a field F_(q) _(¹) , an attack based on the method suggested by Menezes, Okamoto and Vanstone in an article entitled “Reducing elliptic curve logarithms to logarithms in finite field” published in the Proceeding 22 Annual ACM Symposium Theory Computing 1991, pp. 80-89, (The MOV attack) shows that for small values of k, the attack becomes subexponential. Most supersingular curves have small values of k associated with them. In general however, non-supersingular curves have large values of k and provided k>log²q then the MOV attack becomes less efficient than more conventional general attacks.

The use of a supersingular curve is attractive since the doubling of a point (i.e. the case where P=Q) does not require any real time inversions in the underlying field. For a supersingular curve, the coordinates of 2P are $x_{3} = \frac{x_{1}^{4} \oplus b^{2}}{a^{2}}$ and $y_{3} = {{\left( \frac{x_{1}^{2} \oplus b}{a} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus y_{1} \oplus {a.}}$

Since a is a constant, a⁻¹ and a⁻² is fixed for a given curve and can be precomputed. The values of x₁ ² and x₁ ⁴ can be computed with a single and double cyclic shift respectively on the multiplier 48. However, the subsequent addition of dissimilar points to provide the value of dP still requires the computation of an inverse as $x_{3} = {\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)^{2} \oplus x_{1} \oplus x_{2}}$

$y_{3} = \left\{ {{\left( \frac{y_{1} \oplus y_{2}}{x_{1} \oplus x_{2}} \right)\left( {x_{1} \oplus x_{3}} \right)} \oplus y_{1} \oplus a} \right.$

Accordingly, although supersingular curves lead to efficient implementations, there is a relatively small set of supersingular curves from which to choose, particularly if the encryption is to be robust. For a supersingular curve where m is odd, there are 3 classes of curve that can be considered further, namely

y ² +y=x ³

y ² y=x ³ +x

y ² y=x ³ +x+1

However, a consideration of these curves for the case where m=155 shows that none provide the necessary robustness from attack.

Enhanced security for supersingular curves can be obtained by employing quadratic extensions of the underlying field. In fact, in F_(q) where q=2³¹⁰, i.e. a quadratic extension of F₂ _(¹⁵⁵) , amongst the supersingular curves, there are four which under the MOV attack require computation of discrete logs in F₂ _(²³⁰) . These curves provide the requisite high security and also exhibit a high throughput. Similarly, in other extensions of subfields of F₂ _(²³⁰) (e.g. F₂ _(³¹) ) other curves exist that exhibit the requisite robustness. However, their use increases the digits that define a point and hence the bandwidth when they are transmitted.

By contrast, the number of nonsupersingular curves of F_(q),q=2¹⁵⁵, is 2(2¹⁵⁵−1). By selecting q=2 i.e. a field F₂ _(^(m)) , the value of a in the representation of the curve, y²+xy=x³+ax²+b, can be chosen to be either 1 or 0 without loss of generality. This large choice of curves permits large numbers of curves over this field to be found for which the order of a curve is divisible by a large prime factor. In general, determining the order of an arbitrary nonsupersingular curve over F_(q) is not trivial and one approach is explained further in a paper entitled “Counting Points on Elliptic Curves” by Menezes, Vanstone and Zuccherato, Mathematics of Computation 1992.

In general however, the selection of suitable curves is well known in the art, as exemplified in “Application of Finite Fields”, chapters 7 and 8, by Menezes, Blake et al, Kluwer Academic Publishers (ISBN 0-7923-9282-5). Because of the large numbers of such curves that meet the requirements, the use of nonsupersingular curves is preferred despite the added computations.

An alternative approach that reduces the number of inversions when using nonsupersingular curves is to employ homogeneous coordinates. A point P is defined by the coordinates (x,y,z,) and Q by the point (x₂, y₂, x₂) The point (0 , 1 , 0) represents the identity 0 in E.

To derive the addition formulas for the elliptic curve with this representation, we take points P=(x₁, y₂, z₁) and Q=(x₂, y₂, z₂) normalize each to (x₁/z₁, y₁/z₁, 1) (x₂/z₂, y₂/z₂, 1), and apply the previous addition formulas. If

P=(x₁, y₁, z₁), Q=(x₂, y₂, z₂), P,Q≠0, and P≠−Q, then

P+Q=(x₃, y₃, z₃)

x₃=AD

y₃=CD+A² (Bx₁+Ay₂)

z₃=A²z₁z₂

where A=x₂z₁+x₁z₂, B=y₂z₁+y₁z₂,C=A+B and

D=A² (A+az₁z₂)+z₁z₂BC.

In the case of P=Q, then

x₃=AB

y₃=x₁ ⁴A+B(x₁ ²+y₁z₁+A)

z₃=A³

where A=x₁z₁ and B=bz₁ ⁴+x₁ ⁴.

It will be noted that the computation of x₃ y₃ and z₃ does not require any inversion. However, to derive the coordinates x₃*, y₃* in a nonhomogeneous representation, it is necessary to normalize the representation so that $x_{3}^{*} = {{\frac{x_{3}}{z_{3}}\quad y_{3}^{*}} = \frac{y_{3}}{z_{3}}}$

This operation requires an inversion that utilizes the procedure noted above. However, only one inversion operation is required for the computation of dP.

Using homogeneous coordinates, it is still possible to compute dP using the version of the double and add method described above. The computing action of P+Q, P≠Q, requires 13 field multiplications, and 2P requires 7 multiplications.

Alternative Key Transfer

In the example above, the coordinates of the keys kP kdP are each transferred as two 155 bit field elements for F₂ _(¹⁵⁵) . To reduce the bandwidth further it is possible to transmit only one of the co-ordinates and compute the other coordinate at the receiver. An identifier, for example a single bit of the correct value of the other coordinate, may also be transmitted. This permits the possibilities for the second coordinate to be computed by the recipient and the correct one identified from the identifier.

Referring therefore to FIG. 1, the transmitter 10 initially retrieves as the public key dP of the receiver 12, a bit string representing the coordinate x₀ and a single bit of the coordinate y₀.

The transmitter 10 has the parameters of the curve in register 30 and therefore may use the coordinate x₀ and the curve parameters to obtain possible values of the other coordinate y₀ from the arithmetic unit 20.

For a curve of the form y²+xy=x³+ax²+b and a coordinate x₀, then the possible values y₁, y₂ for y₀ are the roots of the quadratic y²+x₀y=x₀ ³+ax₀ ²+b.

By solving for y, in the arithmetic unit 20 two possible roots will be obtained and comparison with the transmitted bit of information will indicate which of the values is the appropriate value of y.

The two possible values of the second coordinate (y₀) differ by x₀, i.e. y₁=y₂+x₀.

Since the two values of y₀ differ by x₀, then y₁ and y₂ will always differ where a “1” occurs in the representation of x₀. Accordingly the additional bit transmitted is selected from one of those positions and examination of the corresponding bit of values of y₀, will indicate which of the two roots is the appropriate value.

The receiver 10 thus can generate the coordinates of the public key dP even though only 156 bits are retrieved.

Similar efficiencies may be realized in transmitting the session key kP to the receiver 12 as the transmitter 10 need only forward one coordinate, x₀ and the selected identifying bit of y₀. The receiver 12 may then reconstruct the possible values of y₀ and select the appropriate one.

In the field F₂ _(^(m)) it is not possible to solve for y using the quadratic formula as 2a=0. Accordingly, other techniques need to be utilised and the arithmetic unit 20 is particularly adapted to perform this efficiently.

In general provided x₀ is not zero, if y=x₀z then x₀ ²z²+x₀ ²z=x³ ₀+ax₀ ²+b.

This may be written as ${z^{2} + z} = {{x_{0} + a + \frac{b}{x_{0}^{2}}} = {c.}}$

i.e. z²+z=c.

If m is odd then either z=c+c⁴+c¹⁶ . . . +. . . +c² ^(m−1) =c² ^(g) +c² ³ +c² ⁴ +. . . +c² ^(g) +c² ^(m−1)

or z=1+c² ⁰ +. . . +c² ^(m−1) to provide two possible values for y₀.

A similar solution exists for the case where m is even that also utilises terms of the form c² ^(e) .

This is particularly suitable for use with a normal basis representation in F₂ _(^(m)) .

As noted above, raising a field element in P₂ _(^(m)) to a power g can be achieved by a g fold cyclic shift where the field element is represented as a normal basis.

Accordingly, each value of z can be computed by shifting and adding and the values of y₀ obtained. The correct one of the values is determined by the additional bit transmitted.

The use of a normal basis representation in F₂ _(^(m)) therefore simplifies the protocol used to recover the coordinate y₀.

If P=(x₀y₀) is a point on the elliptic curve E:y² +xy=x³+ax²+b defined over a field F₂ _(^(m)) , then y₀ is defined to be 0 if x₀=0; if x₀≠0 then y₀ is defined to be the least significant bit of the field element y₀·x₀ ⁻¹.

The x-coordinate x₀ of P and the bit {overscore (y)}₀ are transmitted between the transmitter 10 and receiver 12. Then the y coordinate y₀ can be recovered as follows.

1. If x₀0 then y₀ is obtained by cyclically shifting the vector representation of the field element b that is stored in parameter register 30 one position to the left. That is, if

b=b_(m−1)b_(m−2) . . . b₁b₀

then y₀=b_(m−2) . . . b₁b₀b_(m−1)

2. If x₀0 then do the following:

2.1 Compute the field element c+x₀+a+bx₀ ⁻² in F₂ ^(m).

2.2 Let the vector representation of c be c=c_(m−1) c_(m−2) . . . c₁c₀.

2.3 Construct a field element z=z_(m−1)z_(m−2) . . . z₁z₀ by setting

z₀=y₀,

z₁=c₀⊕z₀,

z₂=c₁⊕z₁,

. . .

z_(m−2)=c⁻³⊕z_(m−3),

z_(m−1)=c_(m−2)⊕z_(m−2).

2.4 Finally, compute y₀=x₀·z.

It will be noted that the computation of x₀ ⁻¹ can be readily computed in the arithmetic unit 20 as described above and that the computation of y₀ can be obtained from the multiplier 48.

In the above examples, the identification of the appropriate value of y₀ has been obtained by transmission of a single bit and a comparison of the values of the roots obtained. However, other indicators may be used to identify the appropriate one of the values and the operation is not restricted to encryption with elliptic curves in the field GF(2^(m)). For example, if the field is selected as Zp p=3(mod 4) then the Legendre symbol associated with the appropriate value could be transmitted to designate the appropriate value. Alternatively, the set of elements in Zp could be subdivided into a pair of subsets with the property that if y is in one subset, then −y is in the other, provided y≠0. An arbitrary value can then be assigned to respective subsets and transmitted with the coordinate x₀ to indicate in which subset the appropriate value of y₀ is located. Accordingly, the appropriate value of y₀ can be determined. Conveniently, it is possible to take an appropriate representation in which the subsets are arranged as intervals to facilitate the identification of the appropriate value of y₀.

These techniques are particularly suitable for encryption utilizing elliptic curves but may also be used with any algebraic curves and have applications in other fields such as error correcting coding where coordinates of points on curves have to be transferred.

It will be seen therefore that by utilising an elliptic curve lying in the finite field GF₂ ^(m) and utilising a normal basis representation, the computations necessary for encryption with elliptic curves may be efficiently performed. Such operations may be implemented in either software or hardware and the structuring of the computations makes the use of a finite field multiplier implemented in hardware particularly efficient. 

We claim:
 1. A method of computing an inverse of a number x with a finite field multiplier operating in the finite field GF(2^(M)) and having elements A² ^(i) (o≦i<i that constitute a normal basis, said multiplier having a pair of m celled recirculating shift registers connected to a m celled recirculating accumulating register to generate in each of said accumulating register a respective grouped term of the normal basis representation of the product of a pair of elements located in respective ones of said recirculating shift registers, said method comprising the steps of a) representing the number x as a vector of binary digits x_(i) where x_(i) is the coefficient of A² ^(i) in the normal basis representation of x, b) loading in to each of said shift registers the vector of binary digits x_(i) representing the normal basis representation of x², c) cyclically shifting the binary digits of a first of said registers one cell to provide in said first register a vector representing x⁴, d) rotating said vectors in said shift registers and cojointly rotating said accumulating register with a m fold cyclic shift to generate in the cells of said accumulating register the m grouped terms representing the vector of the product of x² and x⁴, e) loading the vector from the accumulating register to a second of said shift registers, f) repeating the steps of (c), (d), and (e) (g−2) times where g is a factor of m−1 to provide in said accumulating register a vector γ which is the normal basis representation of the exponentiation of $x^{\sum\limits_{j = 0}^{g - 1}2^{j}},$

g) loading the vector representing the normal basis representation of γ in each of said shift registers, h) performing a g-fold cyclic shift the binary digits of the vector in one of said shift registers where g is a factor of m−1 and g.h=m−1 to provide a vector representing y² ^(e) in said one register, i) rotating said bit elements in said shift registers and said accumulating register to generate grouped terms of the vector representing the product of γ and y² ^(e) , j) loading the vector from the accumulating register to the other of said shift registers, k) repeating steps h), i), and j) a total of g(h−1)-1 times to provide in said accumulating cell a vector of binary digits of the coefficients of the normal basis representation of the inverse of x.
 2. A method according to claim 1 including the step of loading the vector representing x into one of said registers, performing a 1 cell cyclic shift to provide x² and copying the resultant vector in to the other of said registers.
 3. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve, said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said second coordinates having values differing in at least one but not all bit positions, said method comprising the steps of: a) said first correspondent selecting an identifying bit of said second coordinates, said second coordinates differing at the bit position of said identifying bit; b) said first correspondent using said identifying bit to generate identifying information of one of said second coordinates corresponding to said public key; c) said first correspondent forwarding to said second correspondent said first coordinate of said public key and said identifying information; d) said second correspondent computing said second coordinates from said first coordinate and said elliptic curve; e) said second correspondent using said identifying information to determine the appropriate value of said one of said second coordinates corresponding to said public key.
 4. A method according to claim 3, wherein said elliptic curve is over a binary finite field.
 5. A method according to claim 4, wherein selecting said identifying bit includes selecting a bit position in said first coordinate with a bit having a predetermined value, and generating said identifying information includes determining a corresponding value of the bit in said one of second coordinates at said bit position.
 6. A method according to claim 5, wherein said predetermined value is
 1. 7. A method according to claim 6, wherein said identifying information is equal to said corresponding value.
 8. A method according to claim 7, further comprising the step of said second correspondent comparing said identifying information to the bit at said bit position in at least one of said second coordinates to choose the appropriate value of said one of said second coordinates corresponding to said public key.
 9. A method according to claim 3, wherein said elliptic curve is over a field of prime characteristic.
 10. A method according to claim 9, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and for each nonzero element in said subset, the negation of the element is not in said subset.
 11. A method according to claim 10, wherein said identifying information includes a value assigned to said identified subset.
 12. A method according to claim 9, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and said identifying information includes a value assigned to said subset.
 13. A method according to claim 12, wherein said subset is an interval.
 14. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve, said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said first correspondent forwarding to said second correspondent said first coordinate of said public key; b) said first correspondent identifying to said second correspondent a subset containing one of said second coordinates corresponding to said public key, and excluding the other of said second coordinates; c) said second correspondent computing said second coordinates from said first coordinate and said curve; d) said second correspondent determining which of said second coordinates is contained in the identified subset to thereby determine the value of said one of said second coordinates corresponding to said public key.
 15. A method according to claim 14 wherein for each nonzero element in one of said subset, the negation of the element is not in said subset.
 16. A method according to claim 15, wherein said subset is an interval.
 17. A method according to claim 16, wherein step (b) includes transmitting a value assigned to said identified subset.
 18. A method according to claim 14, wherein said subset is an interval.
 19. A method according to claim 14, wherein step (b) includes transmitting a value assigned to said identified subset.
 20. A method according to claim 19, wherein said subset is an interval.
 21. A method of transferring coordinates of a public key in an elliptic curve cryptosysyem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve defined over a field F_(p), said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said first correspondent forwarding to said second correspondent said first coordinate of said public key, together with identifying information of one of said second coordinates corresponding to said public key, said identifying information indicating the Legendre symbol associated with said one of said second coordinates; b) said second correspondent computing said second coordinates from said first coordinate and said elliptic curve; and c) said second correspondent using said identifying information to determine the appropriate value of said one of second coordinates corresponding to said public key.
 22. A method of transferring coordinates of a public key in an elliptic curve cryptosysytem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve, said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said second coordinates having values differing in at least one but not all bit positions, said method comprising the steps of: a) said first correspondent selecting an identifying bit of said second coordinates, said second coordinates differing at the bit position of said identifying bit; b) said first correspondent using said identifying bit to generate identifying information of one of said second coordinates corresponding to said public key; c) said first correspondent forwarding to said second correspondent said first coordinate of said point and said identifying information;  whereby said second correspondent may compute said second coordinates from said first coordinate and said parameters of said elliptic curve and use said identifying information to determine the appropriate value of said one of said second coordinates corresponding to said public key.
 23. A method according to claim 22, wherein said elliptic curve is over a binary finite field.
 24. A method according to claim 23, wherein selecting said identifying bit includes selecting a bit position in said first coordinate with a bit having a predetermined value, and generating said identifying information includes determining a corresponding value of the bit in said one of second coordinates at said bit position.
 25. A method according to claim 24, wherein said predetermined value is
 1. 26. A method according to claim 25, wherein said identifying information is equal to said corresponding value.
 27. A method according to claim 22, wherein said elliptic curve is over a field of prime characteristic.
 28. A method according to claim 27, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and for each nonzero element in said subset, the negation of the element is not in said subset.
 29. A method according to claim 28, wherein said identifying information includes a value assigned to said identified subset.
 30. A method according to claim 27, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and said identifying information includes a value assigned to said subset.
 31. A method according to claim 30, wherein said subset is an interval.
 32. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said second coordinates having values differing in at least one but not all bit positions, said method comprising the steps of: a) said second correspondent receiving said first coordinate of said public key and identifying information of one of said second coordinates corresponding to said public key, said identifying information being computed from an identifying bit of said second coordinate, said second coordinates differing at the bit position of said identifying bit; b) said second correspondent computing said second coordinates from said first coordinate and said elliptic curve; c) said second correspondent using said identifying information to determine the appropriate value of said one of said second coordinates corresponding to said public key.
 33. A method according to claim 32, wherein said elliptic curve is over a binary finite field.
 34. A method according to claim 33, said identifying bit being computed by selecting a bit position in said first coordinate with a bit having a predetermined value, and said identifying information being computed by determining a corresponding value of the bit in said one of second coordinates at said bit position.
 35. A method according to claim 34, wherein said predetermined value is
 1. 36. A method according to claim 35, wherein said identifying information is equal to said corresponding value.
 37. A method according to claim 36, further comprising the step of said second correspondent comparing said identifying information to the bit at said bit position in at least one of said second coordinates to choose the appropriate value of said one of said second coordinates corresponding to said public key.
 38. A method according to claim 32, wherein said elliptic curve is over a field of prime characteristic.
 39. A method according to claim 38, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and for each nonzero element in said subset, the negation of the element is not in said subset.
 40. A method according to claim 39, wherein said identifying information includes a value assigned to said identified subset.
 41. A method according to claim 38, wherein said identifying information indicates to said second correspondent a subset containing said one of said second coordinates corresponding to said public key and excluding the other of said second coordinates, and said identifying information includes a value assigned to said identified subset.
 42. A method according to claim 41, wherein said subsets are intervals.
 43. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve, said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said first correspondent forwarding to said second correspondent said first coordinate of said public key; b) said first correspondent identifying to said second correspondent a subset containing one of said second coordinates corresponding to said public key, and excluding the other of said second coordinates;  whereby said second correspondent may compute said second coordinates from said first coordinate and said curve and determine which of said second coordinates is contained in the identified subset to thereby determine the value of said one of second coordinates corresponding to said public key.
 44. A method according to claim 43 wherein for each nonzero element in said subset, the negation of the element is not in said subset.
 45. A method according to claim 44, wherein said subset is an interval.
 46. A method according to claim 45, wherein step (b) includes transmitting a value assigned to said identified subset.
 47. A method according to claim 43, wherein said subset is an interval.
 48. A method according to claim 43, wherein step (b) includes transmitting a value assigned to said identified subset.
 49. A method according to claim 48, wherein said subset is an interval.
 50. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said second correspondent receiving from said first correspondent said first coordinate of said public key; b) said second correspondent receiving from said first correspondent an identification of a subset one of said second coordinates corresponding to said point, and excluding the other of said second coordinates; c) said second correspondent computing said second coordinates from said first coordinate and said curve; d) said second correspondent determining which of said second coordinates is contained in the identified subset to thereby determine the value of said one of said second coordinates corresponding to said public key.
 51. A method according to claim 50 wherein for each nonzero element in said subset, the negation of the element is not in said subset.
 52. A method according to claim 51, wherein said subsets is an interval.
 53. A method according to claim 52, wherein step (b) includes receiving a value assigned to said identified subset.
 54. A method according to claim 50, wherein said subset is an interval.
 55. A method according to claim 50, wherein step (b) includes receiving a value assigned to said identified subset.
 56. A method according to claim 55, wherein said subset is an interval.
 57. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve defined over a field F_(p), said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said first correspondent forwarding to said second correspondent said first coordinate of said public key, together with identifying information of one of said second coordinates corresponding to said public key, said identifying information indicating the Legendre symbol associated with said one of said second coordinates,  whereby said second correspondent may compute said second coordinates from said coordinate and said elliptic curve, and use said identifying information to determine the appropriate value of said one of second coordinates corresponding to said public key.
 58. A method of transferring coordinates of a public key in an elliptic curve cryptosystem from a first correspondent to a second correspondent connected to said first correspondent by a data communications link and having the parameters of the curve, said public key being a point on a non-supersingular elliptic curve defined over a field F_(p), said coordinates including a first coordinate determining two possible points on the curve, each of said possible points having a respective second coordinate, said method comprising the steps of: a) said second correspondent receiving from said first correspondent said first coordinate of said public key together with identifying information of one of said second coordinates corresponding to said public key, said identifying information indicating the Legendre symbol associated with said one of said second coordinates; b) said second correspondent computing said second coordinates from said first coordinate and said elliptic curve; and c) said second correspondent using said identifying information to determine the appropriate value of said one of said second coordinates a corresponding to said public key. 